Speaker interpolation in HMM-based speech synthesis system
نویسندگان
چکیده
This paper describes an approach to voice characteristics conversion for HMM-based text-to-speech synthesis system by using speaker interpolation. An HMM interpolation technique is derived from a probabilistic distance measure for HMMs, and used to synthesize speech with untrained speaker’s characteristics by interpolating HMM parameters among some representative speakers’ HMM sets. The result of subjective experiments shows that the characteristics of synthesized speech is gradually changed from one’s to the other’s by changing the interpolation ratio.
منابع مشابه
Improvements of Hungarian Hidden Markov Model-based Text-to-Speech Synthesis
Statistical parametric, especially Hidden Markov Model-based, text-tospeech (TTS) synthesis has received much attention recently. The quality of HMM-based speech synthesis approaches that of the state-of-the-art unit selection systems and possesses numerous favorable features, e.g. small runtime footprint, speaker interpolation, speaker adaptation. This paper presents the improvements of a Hung...
متن کاملInterpolation of Austrian German and Viennese Dialect/Sociolect in HMM-based Speech Synthesis
In contrast to widely used waveform concatenation methods, the presented approach to speech synthesis relies on a parametric analysis–re-synthesis technique, where the features extracted in the analysis stage are modeled by hidden Markov models (HMMs). Many important improvements in the last decade have helped this approach to reach impressive performance. Additionally, its inherent flexibility...
متن کاملSpeaker-Dependent Model Interpolation for Statistical Emotional Speech Synthesis
In this article, we propose a speaker-dependent model interpolation method for statistical emotional speech synthesis. The basic idea is to combine the neutral model set of the target speaker and an emotional model set selected from a pool of speakers. For model selection and interpolation weight determination, we propose to use a novel monophone-based Mahalanobis distance, which is a proper di...
متن کاملContinuous Control of the Degree of Articulation in HMM-Based Speech Synthesis
This paper focuses on the implementation of a continuous control of the degree of articulation (hypo/hyperarticulation) in the framework of HMM-based speech synthesis. The adaptation of a neutral speech synthesizer to generate hypo and hyperarticulated speech using a limited amount of speech data is first studied. This is done using inter-speaker voice adaptation techniques, applied here to int...
متن کاملA robust speaker verification system against imposture using an HMM-based speech synthesis system
This paper describes a text-prompted speaker verification system which is robust to imposture using synthetic speech generated by an HMM-based speech synthesis system. In the verification system, text and speaker are verified separately. Text verification is based on phoneme recognition using HMM, and speaker verification is based on GMM. To discriminate synthetic speech from natural speech, an...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997